Discretization of Multidimensional Web Data for Informative Dense Regions Discovery

نویسندگان

  • Edmond HaoCun Wu
  • Michael K. Ng
  • Andy M. Yip
  • Tony F. Chan
چکیده

Dense regions discovery is an important knowledge discovery process for finding distinct and meaningful patterns from given data. The challenge in dense regions discovery is how to find informative patterns from various types of data stored in structured or unstructured databases, such as mining user patterns from Web data. Therefore, novel approaches are needed to integrate and manage these multi-type data repositories to support new generation information management systems. In this paper, we first discuss and purpose several discretization methods which are suitable for multidimensional Web data. Based on it, we demonstrate some dense regions discovery applications by using Web usage data from a real Website. The experiments show that the discretization methods are quite effective and efficient, especially for highdimensional data. It also suggests that the discretization methods can be used in other practical Web applications, such as user patterns discovery.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expert Discovery: A web mining approach

Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...

متن کامل

Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining

Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...

متن کامل

Color Reduction in Hand-drawn Persian Carpet Cartoons before Discretization using image segmentation and finding edgy regions

In this paper, we present a method for color reduction of Persian carpet cartoons that increases both speed and accuracy of editing. Carpet cartoons are in two categories: machine-printed and hand-drawn. Hand-drawn cartoons are divided into two groups: before and after discretization. The purpose of this study is color reduction of hand-drawn cartoons before discretization. The proposed algorit...

متن کامل

Foundation of Semantic Oriented Data and Web Mining

An Attribute-Oriented Approach for Knowledge Discovery in Rough Relational Databases........................................................................................................138 Theresa Beaubouef , Frederick E. Petry, Roy Ladner Toward a Theory of Self-Reproducing Temporal Database Systems.............................139 James Kuodo Huang, T. Y. Lin Fast and Memory Efficient Algor...

متن کامل

Microscopic Structures Analysis and Experimental Research of Beak

To reveal the mechanism of the easy discretization and low damage in kernel dispersal, this paper analyzes the microscopic analysis of beak structures and finds that maxillary outside cells of the beak are dense and hard. Besides, the cuticle wrapping on maxilla of chicken's beak can reduce corn kernels damage in the discrete process of ear. From force test of corn ear, we find that value of x ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004